Robustness of Phoneme Classification Using Support Vector Machines: a Comparison between Plp and Acoustic Waveform Representations
نویسندگان
چکیده
Robustness of phoneme recognition to additive noise is investigated for PLP and acoustic waveform representations of speech using support vector machines (SVMs) combined via error-correcting code methods. While recognition in the PLP domain attains superb accuracy on clean data, it is significantly affected by mismatch between training and testing noise levels. The classification in the high-dimensional acoustic waveform domain, on the other hand, is more robust to additive noise. Moreover, these classifiers perform best when trained on clean data. We also show that the simpler structure of the waveform representation allows one to improve performance using custom-designed kernel functions.
منابع مشابه
Tuning support vector machines for robust phoneme classification with acoustic waveforms
This work focuses on the robustness of phoneme classification to additive noise in the acoustic waveform domain using support vector machines (SVMs). We address the issue of designing kernels for acoustic waveforms which imitate the state-ofthe-art representations such as PLP and MFCC and are tuned to the physical properties of speech. For comparison, classification results in the PLP represent...
متن کاملRobustness of Phoneme Classification Using Generative Classifiers: Comparison of the Acoustic Waveform and Plp Representations
The robustness of classification of isolated phoneme segments using generative classifiers is investigated for the acoustic waveform and PLP speech representations. Probabilistic PCA is used to fit a density to each phoneme class followed by maximum likelihood classification. The results show that although PLP performs best in quiet conditions, as the SNR decreases below 0dB acoustic waveforms ...
متن کاملPhoneme Classification in Linear Feature Domains
Phoneme classification is investigated for linear feature domains with the aim of improving robustness to additive noise. In linear feature domains noise adaptation is exact, potentially leading to more accurate classification than representations involving non-linear processing and dimensionality reduction. A generative framework is developed for isolated phoneme classification using linear fe...
متن کاملSpeech Recognition Front End Without Information Loss
Phoneme classification is investigated for linear feature domains with the aim of improving robustness to additive noise. In linear feature domains noise adaptation is exact, potentially leading to more accurate classification than representations involving non-linear processing and dimensionality reduction. A generative framework is developed for isolated phoneme classification using linear fe...
متن کاملImproving the Robustness of Phoneme Classification Using Hybrid Features
This work is concerned with improving the robustness of phoneme classification to additive noise with hybrid features using support vector machines (SVMs). In particular, the cepstral features are combined with local energy features of acoustic waveform segments to form a hybrid representation. The local energy features are taken into account separately in the SVM kernel, and a simple subtracti...
متن کامل